CDS

Accession Number TCMCG075C20400
gbkey CDS
Protein Id XP_007025711.2
Location join(24107669..24107832,24107917..24107975,24108978..24109034,24109167..24109305,24109451..24109526,24109604..24109662,24109741..24109806,24109888..24109971,24110104..24110168,24110338..24110420,24110499..24110639,24110767..24110907,24111042..24111137)
Gene LOC18596902
GeneID 18596902
Organism Theobroma cacao

Protein

Length 409aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007025649.2
Definition PREDICTED: alpha-galactosidase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description alpha-galactosidase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01101        [VIEW IN KEGG]
R01103        [VIEW IN KEGG]
R01104        [VIEW IN KEGG]
R01194        [VIEW IN KEGG]
R01329        [VIEW IN KEGG]
R02926        [VIEW IN KEGG]
R03634        [VIEW IN KEGG]
R04019        [VIEW IN KEGG]
R04470        [VIEW IN KEGG]
R05549        [VIEW IN KEGG]
R05961        [VIEW IN KEGG]
R06091        [VIEW IN KEGG]
KEGG_rclass RC00049        [VIEW IN KEGG]
RC00059        [VIEW IN KEGG]
RC00451        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K07407        [VIEW IN KEGG]
EC 3.2.1.22        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00052        [VIEW IN KEGG]
ko00561        [VIEW IN KEGG]
ko00600        [VIEW IN KEGG]
ko00603        [VIEW IN KEGG]
map00052        [VIEW IN KEGG]
map00561        [VIEW IN KEGG]
map00600        [VIEW IN KEGG]
map00603        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGTATGGCAAAGGGATTCAGAGTCAAGTTTCTTTTTGTTGCTCTTCTCAACCTTTGGGTCGTCCACCAGGCGGCTTGTTCAATGAACGTCAGTAGCCATGGACATCAAGCTCACTCTCGGTTCCTTCTGGACAATGGGGTTTCTCGCACTCCGCCGATGGGTTGGAATAGCTGGAATCACTTTCATTGCGATTTAAATGAGACGATTATAAGGAGTACTGCGGACGCTCTTGTTTCAACTGGTTTGGCAAAACTTGGATACAAGTATGTAAATCTTGATGATTGTTGGGCTGAAGGGGAAAGAGACAAGAAGGGTAATTTAAGGGGCAAGCTCACTACCTTTCCATCTGGCATTAAGGCCCTTGCAGATTATGTTCATTCCAAAGGCTTGAAACTTGGGATATATGCTGATGCTGGTAATAGAACCTGCAGTAACCAAATGCCTGGCTCTCTTGGGCATGAAGATCAAGATGCAAGGACTTTTGCTGAATGGGGGGTTGACTACATAAAGTATGACAACTGCTACAATGATGGTTCCAAAAATCGAGGGAGGTATGTGAGGATGAGTCGTGCATTGCAAAAAGCTGGCCGCCCAATCCATTACTCTTTATGTGAATGGGGACAAGAGAAACCAGCAATATGGGCTGGTGCATATGGCCATTCTTGGAGGACTACAGGGGATATTAATGACACCTGGGCAAGTATAACCTCAATTGCAGATGCAAACGATATTTGGGCAAGATATGCTGGACCTGGCGGGTGGAATGATCCTGACATGCTGGAAGTGGGCAATGGAGGAATGACCGTAGAGGAATACCGGTCTCATTTTAGTATTTGGGCTCTTATGAAGGCTCCTCTGCTTCTTGGATGCGATGTTTCATCTGCCAGCAGAGAGACTCTAAGTATCATTGGAAACAAAGAAGTGATAGACATTAATCAGGACCCACTAGGAGTTCAGGGGAGGAAAATACGAACCAAAGGTGGCCTTGAGATTTGGGCAGGGCCATTATCAAGGGGAAGGGTGGTGGTAGTGTTATGGAACAGAAGCCGCGCGAGAGCACCAATCTCCGTGGGATGGAGAGAAATTGGACTCTCTCCTTCTCGTCCTGTCACTGTTAGGGATGTGTGGAAGCACAAATTTGTTGCAATGAAAAGGCGCTATAGATTGACTTCAAGCGTTGCTTCTCACTCTTGTAAGATGTATGTTATGACACCATTTAGTGGATGA
Protein:  
MSMAKGFRVKFLFVALLNLWVVHQAACSMNVSSHGHQAHSRFLLDNGVSRTPPMGWNSWNHFHCDLNETIIRSTADALVSTGLAKLGYKYVNLDDCWAEGERDKKGNLRGKLTTFPSGIKALADYVHSKGLKLGIYADAGNRTCSNQMPGSLGHEDQDARTFAEWGVDYIKYDNCYNDGSKNRGRYVRMSRALQKAGRPIHYSLCEWGQEKPAIWAGAYGHSWRTTGDINDTWASITSIADANDIWARYAGPGGWNDPDMLEVGNGGMTVEEYRSHFSIWALMKAPLLLGCDVSSASRETLSIIGNKEVIDINQDPLGVQGRKIRTKGGLEIWAGPLSRGRVVVVLWNRSRARAPISVGWREIGLSPSRPVTVRDVWKHKFVAMKRRYRLTSSVASHSCKMYVMTPFSG